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TITLE OF THE INVENTION 

POLYNUCLEOTIDE AND POLYPEPTIDE SEQUENCES OF CANINE CATHEPSIN S 
BACKGROUND OF THE INVENTION 

Cathepsin S (EC 3.4.22.27) is a cysteine protease of the papain family found primarily in 
lysosomes (D. Bromme; M.E. McGrath, "High Level Expression and Crystallization of 
Recombinant Human Cathepsin S'\ Protein Science (1996) 5:789-791). Cathepsin S (CatS) is 
a cysteine protease expressed in lymphatic tissues. It is has been identified as playing a major 
role in invariant chain proteolysis which is a prerequisite for peptide loading of MHC class II 
(Riese et al. (1996) Immunity 4:357). It has 50 to 60% identity to cathepsins L and K, but differs 
in that it has a broad pH optimum that extends to alkaline pH. Inhibitors have been shown in 
animal models to modulate antigen presentation and are effective in an asthma model (Riese et 
al, J. Clin. Invest. (1998) 101:2351). Mice deficient in cathepsin S have an impaired ability to 
present exogenous proteins by professional antigen presenting cells (Nakagawa et al, Immunity 
(1999) 10:207; Shi etal, Immunity (1999) 10:197). 

The role of cathepsin S in the immune response is anticipated by its tissue distribution: cathepsin 
S is found primarily in lymphatic tissues, lymph nodes, the spleen, B lymphocytes, and 
macrophages (H. Kirschke, "Cathepsin S", In Handbook of Proteolytic Enzymes (Barrett, 
Rawlings and Woessner (Eds.), San Diego:Academic Press) (1998) Ch. 211: 621-624.). 
Cathepsin S inhibitors have been shown in animal models to modulate antigen presentation and 
are effective in an animal model of asthma (Riese et al., "Cathepsin S Activity Regulates 
Antigen Presentation and Immunity", J. Clin. Invest. (1998) 101:2351-23631 and Sta et al, 
"Cathepsin S Required for Normal MHC Class II Peptide Loading and Germinal Center 
Development", Immunity (1999) 10:197-206). 
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The recognition of antigen-presenting MHC class II molecules by CD4 + T cells is a crucial 
component of the immunological response. Class II molecules, like other trans-membrane 
proteins, are trans-located into the endoplasmic reticulum after synthesis, where they associate 
with a third protein, the invariant chain (Ii). This molecule is a type II trans-membrane protein 
that serves as a class II-specific chaperone which promotes the exit of class Il-Ii complexes from 
the endoplasmic reticulum and prevents class II molecules from binding peptides and unfolded 
proteins in the endoplasmic reticulum and in the secretory pathway. 

A targeting motif in the cytoplasmic tail of Ii directs the complexes from the secretory pathway 
into the endosomal system. Before the MHC class II molecules can present antigen the D must 
be removed. This is accomplished by a series of proteases that break Ii down into small peptides. 
However, an Ii fragment, called class H-associated invariant chain peptide (CLIP), which 
occupies the peptide-binding groove of the class II molecule, is in most cases not spontaneously 
released. The CLIP fragment serves as a substitute peptide that protects the class II binding 
pocket from collapsing both during intracellular transport and after Ii degradation in the 
endosomal system. Binding of antigenic peptides, generated from endocytosed proteins, requires 
an empty, yet open binding site, and therefore CLIP has to be released while the open binding 
site needs to be stabilized to allow the binding of other peptides. Human Leukocyte Antigen 
DM ('HLA-DM') has been well documented to mediate both of these functions, thus promoting 
the binding of antigenic peptides. After acquiring peptides, the class II molecules are transported 
to the cell surface via routes that are largely unknown. 

Blocking the presentation of antigens is a promising way to inhibit the immune response. This 
could be done by disrupting the uptake, the proteolytic processing, or binding to MHC class II 
molecules. Blocking the uptake may be problematic since many different cell types require this 
function. Inhibition of the proteolytic processing of particular antigens may be of use since 
different proteases may be involved in cleaving different antigens, however these proteases are 
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not specific and may lead to other side effects. One way to specifically block the binding to the 
antigens to the MHC class II is to inhibit the proteolysis of the invariant chain. If this is not 
removed then the MHC class II molecules cannot be loaded with peptides, hence blocking Ii 
degradation would decrease antigen presentation to CD4+ T-cells and disrupt the normal immune 
response. 

Mice in which the gene encoding cathepsin S has been knocked out are less susceptible to 
collagen-induced arthritis and their immune systems have an impaired ability to respond to 
antigens CNakagawa et al., "Impaired Invariant Chain Degradation and Antigen Presentation and 
Diminished Collagen-Induced Arthritis in Cathepsin S Null Mice", Immunity (1999) 10:207- 
217). These data demonstrate that compounds that inhibit the proteolytic activity of human 
cathepsin S should find utility in the treatment of chronic autoimmune diseases including, but not 
limited to, lupus, rheumatoid arthritis, and asthma; and have potential utility in modulating the 
immune response to tissue transplantation. 

There are a number of cathepsin S inhibitors reported in the literature, some of which are listed 
below. Certain dipeptidyl nitriles are claimed by Novartis as cathepsin S inhibitors (Altmann et 
al., WO-99/24460) Dipeptidyl vinyl sulfones are claimed by Arris (now Axys) as cysteine 
protease (including cathepsin S) inhibitors in: Palmer et al., US 5,976,858. Certain peptidyl 
sulfonamides are claimed by Arris/Axys as cysteine protease (including cathepsin S) inhibitors 
in: Palmer et al., US 5,776,718 (assigned to Arris, now Axys) and Klaus et al., US 6,030,946. 

Methods of modulating autoimmunity with an agent that modulates cathepsin S activity (e.g. 
proteolysis of the Ii chain, as well as methods of treating a subject having an autoimmune 
disorder, methods of evaluating a treatment for its ability to modulate an immune response) are 
described in WO 99/58153. 
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SUMMARY OF THE INVENTION 

A DNA molecule encoding Canine Cathepsin S has been cloned and characterized and it 
represents a novel nucleotide and amino acid sequence. Using a recombinant expression system, 
functional DNA molecules encoding the canine cathepsin S protease have been isolated. The 
biological and structural properties of these proteins are disclosed, as is the amino acid and 
nucleotide sequence. The recombinant DNA molecules, and portions thereof, are useful for 
isolating homologues of the DNA molecules, identifying and isolating genomic equivalents of 
the DNA molecules, and identifying, detecting or isolating mutant forms of the DNA molecules. 
The recombinant protein is useful to identify modulators of functional Cathepsin S. Modulators 
identified in the assays disclosed herein are useful as therapeutic agents. The canine Cathepsin S 
nucleic acid and polypeptide molecules of the present invention are also useful in studying the 
properties of Cathepsin S modulators, such as in developing compounds intended for 
pharmaceutical use in any species, including but not limited to humans. 

BRIEF DESCRIPTION OF THE DRAWING 

Figure 1: The nucleotide sequence of Canine Cathepsin S is shown. 

Figure 2: The amino acid sequence of Canine Cathepsin S is shown. 

Figure 3: Functional expression of Canine Cathepsin S in recombinant host cells is shown. 
Figure 4, 

Panels A and B: Modulation of Canine Cathepsin S by a known Canine Cathepsin S 
inhibitor is shown. 



Page 4 



ORT-1458 



DETAILED DESCRIPTION 

Definitions 

The term "protein domain" as used herein refers to a region of a protein that may have a 
particular three-dimensional structure that may be independent from the remainder of the protein. 
This structure may maintain a particular activity associated with the domain's function within the 
protein including enzymatic activity, creation of a recognition motif for another molecule, or 
provide necessary structural components for a protein to exist in a particular environment. 
Protein domains are usually evolutionarily conserved regions of proteins, both within a protein 
family and within protein superfamilies that perform similar functions. The term "protein 
superfamily" as used herein refers to proteins whose evolutionary relationship may not be 
entirely established or may be distant by accepted phylogenetic standards, but show a similar 
three dimensional structure or display unique consensus of critical amino acids. The term 
"protein family" as used herein refers to proteins whose evolutionary relationship has been 
established by accepted phylogenic standards. 

The term "fusion protein" as used herein refers to protein constructs that are the result of 
combining multiple protein domains or linker regions for the purpose of gaining the combined 
functions of the domains or linker regions. This is may be accomplished by molecular cloning of 
the nucleotide sequences encoding such domains to produce a new polynucleotide sequence that 
encodes the desired fusion protein. Alternatively, creation of a fusion protein may be 
accomplished by chemically joining two proteins. 

The term "linker region" or "linker domain" or similar such descriptive terms as used herein 
refers to polynucleotide or polypeptide sequence that are used in the construction of a cloning 
vector or fusion protein. Functions of a linker region can include introduction of cloning sites 
into the nucleotide sequence, introduction of a flexible component or space-creating region 
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between two protein domains, or creation of an affinity tag for specific molecule interaction. A 
linker region may be introduced into a fusion protein resulting from choices made during 
polypeptide or nucleotide sequence construction. 

The term "cloning site" or "polycloning site" as used herein refers to a region of the nucleotide 
sequence that has one or more available restriction endonuclease consensus cleavage sequences. 
These nucleotide sequences may be used for a variety of purposes, including but not limited to 
introduction into DNA vectors to create novel fusion proteins, or to introduce specific site- 
directed mutations. It is well known by those of ordinary skill in the art that cloning sites can be 
engineered at a desired location by silent mutations, conserved mutation, or introduction of a 
linker region that contains desired restriction enzyme consensus sequences. It is also well 
known by those of ordinary skill in the art that the precise location of a cloning site can be 
engineered into any location in a nucleotide sequence. 

The term "tag" as used herein refers to an amino acid sequence or a nucleotide sequence that 
encodes an amino acid sequence that facilitates isolation, purification or detection of a protein 
containing the tag. A wide variety of such tags are known to those skilled in the art, and are 
suitable for use in the present invention. Suitable tags include, but are not limited to, HA 
peptide, polyhistidine peptides, biotin / avidin, and other antibody epitope binding sites. 

Isolation of Canine Cathepsin S Nucleic Acid 

The present invention relates to DNA encoding Canine Cathepsin S that was isolated from 
Canine Cathepsin S producing cells. Canine Cathepsin S, as used herein, refers to protein that 
can specifically function as a cysteine protease. 

The complete amino acid sequence of Canine Cathepsin S was not previously known, nor was 
the complete nucleotide sequence encoding Canine Cathepsin S known. It is predicted that a 
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wide variety of cells and cell types will contain the described canine Cathepsin S. Vertebrate 
cells capable of producing Canine Cathepsin S include, but are not limited to spleen cells, bone 
marrow cell and other lymphoid cells such as B cells, dendritic cells, and macrophages. 

Other cells and cell lines may also be suitable for use to isolate Canine Cathepsin S cDNA. 
Selection of suitable cells may be done by screening for Canine Cathepsin S activity in cell 
extracts or in whole cell assays, described herein. Cells that possess Canine Cathepsin S activity 
in any one of these assays may be suitable for the isolation of Canine Cathepsin S DNA or 
mRNA. 

Any of a variety of procedures known in the art may be used to molecularly clone Canine 
Cathepsin S DNA. These methods include, but are not limited to, direct functional expression of 
the Canine Cathepsin S genes following the construction of a Canine Cathepsin S-containing 
cDNA library in an appropriate expression vector system. Another method is to screen Canine 
Cathepsin S-containing cDNA library constructed in a bacteriophage or plasmid shuttle vector 
with a labelled oligonucleotide probe designed from the amino acid sequence of the Canine 
Cathepsin S subunits. An additional method consists of screening a Canine Cathepsin S- 
containing cDNA library constructed in a bacteriophage or plasmid shuttle vector with a partial 
cDNA encoding the Canine Cathepsin S protein. This partial cDNA is obtained by the specific 
PCR amplification of Canine Cathepsin S DNA fragments through the design of degenerate 
oligonucleotide primers from the amino acid sequence of the purified Canine Cathepsin S 
protein. 

Another method is to isolate RNA from Canine Cathepsin S-producing cells and translate the 
RNA into protein via an in vitro or an in vivo translation system. The translation of the RNA 
into a peptide a protein will result in the production of at least a portion of the Canine Cathepsin 
S protein that can be identified by, for example, immunological reactivity with an anti-Canine 
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Cathepsin S antibody or by biological activity of Canine Cathepsin S protein. In this method, 
pools of RNA isolated from Canine Cathepsin S-producing cells can be analyzed for the presence 
of an RNA that encodes at least a portion of the Canine Cathepsin S protein. Further 
fractionation of the RNA pool can be done to purify the Canine Cathepsin S RNA from non- 
Canine Cathepsin S RNA. The peptide or protein produced by this method may be analyzed to 
provide amino acid sequences which in turn are used to provide primers for production of Canine 
Cathepsin S cDNA, or the RNA used for translation can be analyzed to provide nucleotide 
sequences encoding Canine Cathepsin S and produce probes for this production of Canine 
Cathepsin S cDNA. This, method is known in the art and can be found in, for example, T. 
Maniatis, E.F. Fritsch and J. Sambrook in Molecular Cloning: A Laboratory Manual 2 nd Edition , 
Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989. 

It is readily apparent to those skilled in the art that other types of libraries, as well as libraries 
constructed from other cells or cell types, may be useful for isolating Canine Cathepsin S- 
encoding DNA. Other types of libraries include, but are not limited to, cDNA libraries derived 
from other cells, from organisms other than Canine Cathepsin S, and genomic DNA libraries that 
include YAC (yeast artificial chromosome) and cosmid libraries. 

It is readily apparent to those skilled in the art that suitable cDNA libraries may be prepared from 
cells or cell lines which have Canine Cathepsin S activity. The selection of cells or cell lines for 
use in preparing a cDNA library to isolate Canine Cathepsin S cDNA may be done by first 
measuring cell associated Canine Cathepsin S activity using the measurement of Canine 
Cathepsin S-associated biological activity or a ligand binding assay. 

Preparation of cDNA libraries can be performed by standard techniques well known in the art. 
Well known cDNA library construction techniques can be found for example, in T. Maniatis, 
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E.F. Fritsch and J. Sambrook in Molecular Cloning: A Laboratory Manual 2 n Edition , Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989. 

It is also readily apparent to those skilled in the art that DNA encoding Canine Cathepsin S may 
also be isolated from a suitable genomic DNA library. Construction of genomic DNA libraries 
can be performed by standard techniques well known in the art. Well known genomic DNA 
library construction techniques can be found in T. Maniatis, E.F. Fritsch and J. Sambrook in 
Molecular Cloning: A Laboratory Manual 2 nd Edition , Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York, 1989. 

In order to clone the Canine Cathepsin S gene by the above methods, the amino acid sequence of 
Canine Cathepsin S may be necessary. To accomplish this, Canine Cathepsin S protein may be 
purified and partial amino acid sequence determined by automated sequenators. It is not 
necessary to determine the entire amino acid sequence, but the linear sequence of two regions of 
six to eight amino acids from the protein is determined for the production of primers for PCR 
amplification of a partial Canine Cathepsin S DNA fragment. 

Once suitable amino acid sequences have been identified, the DNA sequences capable of 
encoding them are synthesized. Because the genetic code is degenerate, more than one codon 
may be used to encode a particular amino acid, and therefore, the amino acid sequence can be 
encoded by any of a set of similar DNA oligonucleotides. Only one member of the set will be 
identical to the Canine Cathepsin S sequence but will be capable of hybridizing to Canine 
Cathepsin S DNA even in the presence of DNA oligonucleotides with mismatches. The 
mismatched DNA oligonucleotides may still sufficiently hybridize to the Canine Cathepsin S 
DNA to permit identification and isolation of Canine Cathepsin S encoding DNA. DNA isolated 
by these methods can be used to screen DNA libraries from a variety of cell types, from 
invertebrate and vertebrate sources, and to isolate homologous genes. 
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Purified biologically active Canine Cathepsin S may have several different physical forms. 
Canine Cathepsin S may exist as a full-length nascent or unprocessed polypeptide, or as partially 
processed polypeptides or combinations of processed polypeptides. The full-length nascent 
Canine Cathepsin S polypeptide may be posttranslationally modified by specific proteolytic 
cleavage events that result in the formation of fragments of the full length nascent polypeptide. 
A fragment, or physical association of fragments may have the full biological activity associated 
with Canine Cathepsin S however, the degree of Canine Cathepsin S activity may vary between 
individual Canine Cathepsin S fragments and physically associated Canine Cathepsin S 
polypeptide fragments. 

Because the genetic code is degenerate, more than one codon may be used to encode a particular 
amino acid, and therefore, the amino acid sequence can be encoded by any of a set of similar 
DNA oligonucleotides. Only one member of the set will be identical to the Canine Cathepsin S 
sequence but will be capable of hybridizing to Canine Cathepsin S DNA even in the presence of 
DNA oligonucleotides with mismatches under appropriate conditions. Under alternate 
conditions, the mismatched DNA oligonucleotides may still hybridize to the Canine Cathepsin S 
DNA to permit identification and isolation of Canine Cathepsin S encoding DNA. 

DNA encoding Canine Cathepsin S from a particular organism may be used to isolate and purify 
homologues of Canine Cathepsin S from other organisms. To accomplish this, the first Canine 
Cathepsin S DNA may be mixed with a sample containing DNA encoding homologues of Canine 
Cathepsin S under appropriate hybridization conditions. The hybridized DNA complex may be 
isolated and the DNA encoding the homologous DNA may be purified therefrom. 

Functional Derivatives/Variants 

It is known that there is a substantial amount of redundancy in the various codons that code for 
specific amino acids. Therefore, this invention is also directed to those DNA sequences that 
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contain alternative codons that code for the eventual translation of the identical amino acid. For 
purposes of this specification, a sequence bearing one or more replaced codons will be defined as 
a degenerate variation. Also included within the scope of this invention are mutations either in 
the DNA sequence or the translated protein, which do not substantially alter the ultimate physical 
properties of the expressed protein. For example, substitution of aliphatic amino acids alanine, 
valine, leucine and isoleucine; interchange of the hydroxyl residues serine and threonine, 
exchange of the acidic residues aspartic acid and glutamic acid, substitution between the amide 
residues asparagine and glutamine, exchange of the basic residues lysine and arginine and among 
the aromatic residues phenylalanine, tyrosine may not cause a change in functionality of the 
polypeptide. Such substitutions are well known and are described, for instance in Watson et al., 
Molecular Biology of the Gene, 4 th Edition, ( Bengamin Cummings Publishing Co.). 

It is known that DNA sequences coding for a peptide may be altered so as to code for a peptide 
having properties that are different than those of the naturally occurring peptide. Methods of 
altering the DNA sequences include, but are not limited to site directed mutagenesis, chimeric 
substitution, and gene fusions. Site-directed mutagenesis is used to change one or more DNA 
residues that may result in a silent mutation, a conservative mutation, or a nonconservative 
mutation. Chimeric genes are prepared by swapping domains of similar or different genes to 
replace similar domains in the canine Cathepsin S gene. Similarly, fusion genes may be 
prepared that add domains to the canine Cathepsin S gene, such as an affinity tag to facilitate 
identification and isolation of the gene. Fusion genes may be prepared to replace regions of the 
canine Cathepsin S gene, for example to create a soluble version of the protein by removing a 
transmembrane domain or adding a targeting sequence to redirect the normal transport of the 
protein, or adding new post-translational modification sequences to the canine Cathepsin S gene. 
Examples of altered properties include but are not limited to changes in the affinity of an enzyme 
for a substrate or a receptor for a ligand. All such changes of the polynucleotide or polypeptide 
sequences are anticipated as useful variants of the present invention so long as the original 



Page 11 



ORT-1458 



function of the polynucleotide or polypeptide sequence of the present invention is maintained as 
described herein. 

Identity or similarity, as known in the art, is relationships between two or more polypeptide 
sequences or two or more polynucleotide sequences, as determined by comparing the sequences. 
In the art, identity also means the degree of sequence relatedness between polypeptide or 
polynucleotide sequences, as the case may be, as determined by the match between strings of 
such sequences. Both identity and similarity can be readily calculated 

(Computational Molecular Biology , A. M. Lesk, (Ed.), (1988) Oxford University Press, New 
York; Biocomputing: Informatics and Genome Projects , D. W. Smith (Ed.), (1993) Academic 
Press, New York; Computer Analysis of Sequence Data, Part 1, A. M. Griffin and H. G. Griffin 
(Eds.) (1994) Human Press, New Jersey; Sequence Analysis in Molecular Biology , G. von 
Heinje, (1987) Academic Press; and Sequence Analysis Primer , M. Gribskov and J. Devereux, 
(Eds.) (1991) M. Stockton Press, New York. While there exist a number of methods to measure 
identity and similarity between two polynucleotide or two polypeptide sequences, both terms are 
well known to skilled artisans (Sequence Analysis in Molecular Biology , G. von Heinje (1987) 
Academic Press; Sequence Analysis Primer , M. Gribskov and J. Devereux (Eds.), (1991) M. 
Stockton Press, New York; and H. Carillo and D. Lipman, SIAM J. Applied Math. (1988) 
48:1073). Methods commonly employed to determine identity or similarity between sequences 
include, but are not limited to those disclosed in Carillo and Lipman, SIAM J. Applied Math. 
(1988) 48:1073. Preferred methods to determine identity are designed to give the largest match 
between the sequences tested. Methods to determine identity and similarity are codified in 
computer programs. Preferred computer program methods to determine identity and similarity 
between two sequences include, but are not limited to, GCG program package (Devereux et al., 
Nucleic Acids Research (1984) 12(1):387), BLASTP, BLASTN, and FASTA (Atschul et al., /. 
Molec. Biol (1990) 215:403). 
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Polynucleotide(s) generally refers to any polyribonucleotide or polydeoxyribonucleotide, which 
may be unmodified RNA or DNA or modified RNA or DNA. Thus, for instance, 
polynucleotides as used herein refers to, among others, single- and double-stranded DNA, DNA 
that is a mixture of single- and double-stranded regions or single-, double- and triple- stranded 
regions, single- and double-stranded RNA, and RNA that is mixture of single- and double- 
stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, 
more typically, double-stranded, or triple-stranded, or a mixture of single- and double- stranded 
regions. In addition, polynucleotide as used herein refers to triple-stranded regions comprising 
RNA or DNA or both RNA and DNA. The strands in such regions may be from the same 
molecule or from different molecules. The regions may include all of one or more of the 
molecules, but more typically involve only a region of some of the molecules. One of the 
molecules of a triple-helical region often is an oligonucleotide. As used herein, the term 
polynucleotide includes DNAs or RNAs as described above that contain one or more modified 
bases. Thus, DNAs or RNAs with backbones modified for stability or for other reasons are 
"polynucleotides" as that term is intended herein. Moreover, DNAs or RNAs comprising 
unusual bases, such as inosine, or modified bases, such as tritylated bases, to name just two 
examples, are polynucleotides as the term is used herein. It will be appreciated that a great 
variety of modifications have been made to DNA and RNA that serve many useful purposes 
known to those of skill in the art. The term polynucleotide as it is employed herein embraces 
such chemically, enzymatically or metabolically modified forms of polynucleotides, as well as 
the chemical forms of DNA and RNA characteristic of viruses and cells, including simple and 
complex cells, inter alia. Polynucleotides embraces short polynucleotides often referred to as 
oligonucleotide(s). 

The term polypeptide, as used herein, refers to the basic chemical structure of polypeptides that is 
well known and has been described in textbooks and other publications in the art. In this 
context, the term is used herein to refer to any peptide or protein comprising two or more amino 
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acids joined to each other in a linear chain by peptide bonds. As used herein, the term refers to 
both short chains, which also commonly are referred to in the art as peptides, oligopeptides and 
oligomers, for example, and to longer chains, which generally are referred to in the art as 
proteins, of which there are many types. It will be appreciated that polypeptides often contain 
amino acids other than the twenty amino acids commonly referred to as the twenty naturally 
occurring amino acids, and that many amino acids, including the terminal amino acids, may be 
modified in a given polypeptide, either by natural processes, such as processing and other post- 
translational modifications, but also by chemical modification techniques which are well known 
to the art. Even the common modifications that occur naturally in polypeptides are too numerous 
to list exhaustively here, but they are well described in basic texts and in more detailed 
monographs, as well as in a voluminous research literature, and they are well known to those of 
skill in the art. Among the known modifications which may be present in polypeptides of the 
present are, to name an illustrative few, acetylation, acylation, ADP- ribosylation, amidation, 
covalent attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond formation, 
demethylation, formation of covalent cross-links, formation of cystine, formation of 
pyroglutamate, formylation, gamma-carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated 
addition of amino acids to proteins such as arginylation, and ubiquitination. Such modifications 
are well known to those of skill and have been described in great detail in the scientific literature. 
Several particularly common modifications, glycosylation, lipid attachment, sulfation, gamma- 
carboxylation of glutamic acid residues, hydroxylation and ADP-ribosylation, for instance, are 
described in most basic texts, such as, for instance Proteins - Structure And Molecular Properties, 
2nd Edition , (T. E. Creighton, W. H. Freeman and Company, New York (1993)). Many detailed 
reviews are available on this subject, such as, for example, those provided by F. Wold, 
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"Posttranslational Protein Modifications: Perspectives and Prospects", Posttranslational Covalent 
Modification Of Proteins , (B. C. Johnson, Ed., Academic Press, New York (1983)), pp. 1-12; 
Seifter et al., Meth. EnzymoL (1990) 182:626-646 and Rattan et al., "Protein Synthesis: 
Posttranslational Modifications and Aging", Ann. N.Y. Acad Set (1992) 663:48-62. It will be 
appreciated, as is well known and as noted above, that polypeptides are not always entirely linear. 
For instance, polypeptides may be generally as a result of posttranslational events, including 
natural processing event and events brought about by human manipulation that does not occur 
naturally. Circular, branched and branched circular polypeptides may be synthesized by non- 
translation natural process and by entirely synthetic methods, as well. Modifications can occur 
anywhere in a polypeptide, including the peptide backbone, the amino acid side-chains and the 
amino or carboxyl termini. In fact, blockage of the amino or carboxyl group in a polypeptide, or 
both, by a covalent modification, is common in naturally occurring and synthetic polypeptides 
and such modifications may be present in polypeptides of the present invention, as well. For 
instance, the amino terminal residue of polypeptides made in E. coli or other cells, prior to 
proteolytic processing, almost invariably will be N-formylmethionine. During post-translational 
modification of the peptide, a methionine residue at the NH.sub.2-terminus may be deleted. 
Accordingly, this invention contemplates the use of both the methionine-containing and the 
methionine-less amino terminal variants of the protein of the invention. The modifications that 
occur in a polypeptide often will be a function of how it is made. For polypeptides made by 
expressing a cloned gene in a host, for instance, the nature and extent of the modifications in 
large part will be determined by the host cell posttranslational modification capacity and the 
modification signals present in the polypeptide amino acid sequence. For instance, as is well 
known, glycosylation often does not occur in bacterial hosts such as, for example, E. coli . 
Accordingly, when glycosylation is desired, a polypeptide should be expressed in a glycosylating 
host, generally a eukaryotic cell. Insect cell often carry out the same posttranslational 
glycosylations as mammalian cells and, for this reason, insect cell expression systems have been 
developed to express efficiently mammalian proteins having native patterns of glycosylation, 
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inter alia. Similar considerations apply to other modifications. It will be appreciated that the 
same type of modification may be present in the same or varying degree at several sites in a given 
polypeptide. Also, a given polypeptide may contain many types of modifications. In general, as 
used herein, the term polypeptide encompasses all such modifications, particularly those that are 
present in polypeptides synthesized recombinantly by expressing a polynucleotide in a host cell. 

Variant(s) of polynucleotides or polypeptides, as the term is used herein, are polynucleotides or 
polypeptides that differ from a reference polynucleotide or polypeptide, respectively. A variant 
of the polynucleotide may be a naturally occurring variant such as a naturally occurring allelic 
variant, or it may be a variant that is not known to occur naturally. (1) A polynucleotide that 
differs in nucleotide sequence from another, reference polynucleotide. Generally, differences are 
limited so that the nucleotide sequences of the reference and the variant are closely similar 
overall and, in many regions, identical. As noted below, changes in the nucleotide sequence of 
the variant may be silent. That is, they may not alter the amino acids encoded by the 
polynucleotide. Where alterations are limited to silent changes of this type a variant will encode a 
polypeptide with the same amino acid sequence as the reference. Also as noted below, changes 
in the nucleotide sequence of the variant may alter the amino acid sequence of a polypeptide 
encoded by the reference polynucleotide. Such nucleotide changes may result in amino acid 
substitutions, additions, deletions, fusions and truncations in the polypeptide encoded by the 
reference sequence, as discussed above. (2) A polypeptide that differs in amino acid sequence 
from another, reference polypeptide. Generally, differences are limited so that the sequences of 
the reference and the variant are closely similar overall and, in many regions, identical. A 
variant and reference polypeptide may differ in amino acid sequence by one or more 
substitutions, additions, deletions, fusions and truncations, which may be present in any 
combination. As used herein, a "functional derivative" of Canine Cathepsin S is a compound 
that possesses a biological activity (either functional or structural) that is substantially similar to 
the biological activity of Canine Cathepsin S. The term "functional derivatives" is intended to 
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include the "fragments," "variants," "degenerate variants," "analogs" and "homologies" or to 
"chemical derivatives" of Canine Cathepsin S. Useful chemical derivatives of polypeptide are 
well known in the art and include, for example covalent modification of reactive organic site 
contained within the polypeptide with a secondary chemical moiety. Well known cross-linking 
reagents are useful to react to amino, carboxyl, or aldehyde residues to introduce, for example an 
affinity tag such as biotin, a fluorescent dye, or to conjugate the polypeptide to a solid phase 
surface (for example to create an affinity resin). The term "fragment" is meant to refer to any 
polypeptide subset of Canine Cathepsin S. A molecule is "substantially similar" to Canine 
Cathepsin S if both molecules have substantially similar structures or if both molecules possess 
similar biological activity. Therefore, if the two molecules possess substantially similar activity, 
they are considered to be variants even if the structure of one of the molecules is not found in the 
other or even if the two amino acid sequences are not identical. The term "analog" refers to a 
molecule substantially similar in function to either the entire Canine Cathepsin S molecule or to a 
fragment thereof. Further particularly preferred in this regard are polynucleotides encoding 
variants, analogs, derivatives and fragments of SEQ ID NO: 1, and variants, analogs and 
derivatives of the fragments, which have the amino acid sequence of the polypeptide of SEQ ID 
NO: 2 in which several, a few, 5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues are 
substituted, deleted or added, in any combination. Especially preferred among these are silent 
substitutions, additions and deletions, which do not alter the properties and activities of the gene 
of SEQ ID NO: 1. Also especially preferred in this regard are conservative substitutions. Most 
highly preferred are polynucleotides encoding polypeptides having the amino acid sequence of 
SEQ ID NO: 2 without substitutions. 

Further preferred embodiments of the invention are polynucleotides that are at least 70% 
identical over their entire length to a polynucleotide encoding the polypeptide having the amino 
acid sequence set out in SEQ ID NO: 2, and polynucleotides which are complementary to such 
polynucleotides. Alternatively, highly preferred are polynucleotides that comprise a region that 
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is at least 80% identical, more highly preferred are polynucleotides at comprise a region that is at 
least 90% identical, and among these preferred polynucleotides, those with at least 95% are 
especially preferred. Furthermore, those with at least 97% identity are highly preferred among 
those with at least 95%, and among these those with at least 98% and at least 99% are 
particularly highly preferred, with at least 99% being the most preferred. The polynucleotides 
which hybridize to the hereinabove described polynucleotides in a preferred embodiment encode 
polypeptides which retain substantially the same biological function or activity as the polypeptide 
characterized by the deduced amino acid sequence of SEQ ID NO: 2. Preferred embodiments in 
this respect, moreover, are polynucleotides that encode polypeptides that retain substantially the 
same biological function or activity as the mature polypeptide encoded by the DNA of SEQ ID 
NO: 1. The present invention further relates to polynucleotides that hybridize to the herein 
above-described sequences. In this regard, the present invention especially relates to 
polynucleotides that hybridize under stringent conditions to the herein above-described 
polynucleotides. As herein used, the term "stringent conditions" means hybridization will occur 
only if there is at least 95% and preferably at least 97% identity between the sequences. 

As discussed additionally herein regarding polynucleotide assays of the invention, for instance, 
polynucleotides of the invention may be used as a hybridization probe for RNA, cDNA and 
genomic DNA to isolate full-length cDNAs and genomic clones encoding the sequences of SEQ 
ID NO: 1 and to isolate cDNA and genomic clones of other genes that have a high sequence 
similarity to SEQ ID NO: 1. Such probes generally will comprise at least fifteen bases. 
Preferably, such probes will have at least thirty bases and may have at least 50 bases. Particularly 
preferred probes will have at least thirty bases, and have 50 bases or less. For example, the 
coding region of the gene of the present invention may be isolated using the known DNA 
sequence to synthesize an oligonucleotide probe. A labeled oligonucleotide having a sequence 
complementary to that of a gene of the present invention is then used to screen a library of 
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cDNA, genomic DNA or mRNA to determine to which members of the library the probe 
hybridizes. 

The polypeptides of the present invention include the polypeptide of SEQ ID NO: 2 (in particular 
the mature polypeptide) as well as polypeptides which have at least 70% identity to the 
polypeptide of SEQ ED NO: 2, preferably at least 80% identity to the polypeptide of SEQ ID 
NO: 2, and more preferably at least 90% similarity (more preferably at least 90% identity) to the 
polypeptide of SEQ ID NO: 2 and still more preferably at least 95% similarity (still more 
preferably at least 97% identity) to the polypeptide of SEQ ID NO: 2 and also include portions of 
such polypeptides with such portion of the polypeptide generally containing at least 30 amino 
acids and more preferably at least 50 amino acids. Representative examples of polypeptide 
fragments of the present invention, include, for example, truncation polypeptides of SEQ ID NO: 
2. Truncation polypeptides include polypeptides having the amino acid sequence of SEQ ID NO: 
2, or of variants or derivatives thereof, except for deletion of a continuous series of residues (that 
is, a continuous region, part or portion) that includes the amino terminus, or a continuous series 
of residues that includes the carboxyl terminus or, as in double truncation mutants, deletion of 
two continuous series of residues, one including the amino terminus and one including the 
carboxyl terminus. Also preferred in this aspect of the invention are fragments characterized by 
structural or functional attributes of the polypeptide characterized by the sequences of SEQ ID 
NO: 2. Preferred embodiments of the invention in this regard include fragments that comprise 
alpha-helix and alpha-helix forming regions, beta-sheet and beta-sheet-forming regions, turn and 
turn-forming regions, coil and coil-forming regions, hydrophilic regions, hydrophobic regions, 
alpha amphipathic regions, beta amphipathic regions, flexible regions, surface-forming regions, 
substrate binding region, high antigenic index regions of the polypeptide of the invention, and 
combinations of such fragments. Preferred regions are those that mediate activities of the 
polypeptides of the invention. Most highly preferred in this regard are fragments that have a 
chemical, biological or other activity of the response regulator polypeptide of the invention, 
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including those with a similar activity or an improved activity, or with a decreased undesirable 
activity. 

Recombinant Expression of Canine Cathepsin S 

The cloned Canine Cathepsin S DNA obtained through the methods described herein may be 
recombinantly expressed by molecular cloning into an expression vector containing a suitable 
promoter and other appropriate transcription regulatory elements, and transferred into prokaryotic 
or eukaryotic host cells to produce recombinant Canine Cathepsin S protein. Techniques for 
such manipulations are fully described in Maniatis et al, supra , and are well known in the art. 

Expression vectors are defined herein as DNA sequences that are required for the transcription of 
cloned copies of genes and the translation of their mRNAs in an appropriate host. Such vectors 
can be used to express eukaryotic genes in a variety of hosts such as bacteria including E. coli, 
bluegreen algae, plant cells, insect cells, fungal cells including yeast cells, and animal cells. 

Specifically designed vectors allow the shuttling of DNA between hosts such as bacteria-yeast or 
bacteria-animal cells or bacteria-fungal cells or bacteria-invertebrate cells. An appropriately 
constructed expression vector should contain: an origin of replication for autonomous replication 
in host cells, selectable markers, a limited number of useful restriction enzyme sites, a potential 
for high copy number, and active promoters. A promoter is defined as a DNA sequence that 
directs RNA polymerase to bind to DNA and initiate RNA synthesis. A strong promoter is one 
that causes mRNAs to be initiated at high frequency. Expression vectors may include, but are 
not limited to, cloning vectors, modified cloning vectors, specifically designed plasmids, or 
viruses. 

A variety of mammalian expression vectors may be used to express recombinant Canine 
Cathepsin S in mammalian cells. Commercially available mammalian expression vectors which 
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may be suitable for recombinant Canine Cathepsin S expression, include but are not limited to, 
pMAMneo (Clontech), pcDNA3 (Invitrogen), pMClneo (Stratagene), pXTl (Stratagene), pSG5 
(Stratagene), EBO-pSV2-neo (ATCC 37593) pBPV-l(8-2) (ATCC 37110), pdBPV- 
MMTneo(342-12) (ATCC 37224), pRSVgpt (ATCC 37199), pRSVneo (ATCC 37198), pSV2- 
dhfr (ATCC 37146), pUCTag (ATCC 37460), pEE12 (Cell Tech) and 1ZD35 (ATCC 37565). 

A variety of bacterial expression vectors may be used to express recombinant Canine Cathepsin S 
in bacterial cells. Commercially available bacterial expression vectors that may be suitable for 
recombinant Canine Cathepsin S expression include, but are not limited to pET vectors 
(Novagen) and pQE vectors (Qiagen). 

A variety of fungal cell expression vectors may be used to express recombinant Canine Cathepsin 
S in fungal cells such as yeast. Commercially available fungal cell expression vectors that may 
be suitable for recombinant Canine Cathepsin S expression include but are not limited to pYES2 
(Invitrogen) and Pichia expression vector (Invitrogen). 

A variety of insect cell expression vectors may be used to express recombinant Canine Cathepsin 
S in insect cells. Commercially available insect cell expression vectors which may be suitable 
for recombinant expression of Canine Cathepsin S include but are not limited to pBlueBacH 
(Invitrogen), pFastBac (GibcoBRL), pVL1392 (BD Biosciences), pAcUW51 (BD Biosciences), 
pVL1393 (BD Biosciences), and pAcHP2 (BD Biosciences). 

DNA encoding Canine Cathepsin S may be cloned into an expression vector for expression in a 
recombinant host cell. Recombinant host cells may be prokaryotic or eukaryotic, including but 
not limited to bacteria such as E. coli, fungal cells such as yeast, mammalian cells including but 
not limited to cell lines of human, bovine, porcine, monkey and rodent origin, and insect cells 
including but not limited to Drosophila and silkworm derived cell lines. Cell lines derived from 
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mammalian species which may be suitable and which are commercially available, include but are 
not limited to, CV-1 (ATCC CCL 70), COS-1 (ATCC CRL 1650), COS-7 (ATCC CRL 1651), 
CHO-K1 (ATCC CCL 61), 3T3 (ATCC CCL 92), NIH/3T3 (ATCC CRL 1658), HeLa (ATCC 
CCL 2), C127I (ATCC CRL 1616), BS-C-1 (ATCC CCL 26), MRC-5 (ATCC CCL 171), L- 
cells, andHEK-293 (ATCC CRL1573). 

The expression vector may be introduced into host cells via any one of a number of techniques 
including but not limited to transformation, transfection, protoplast fusion, lipofection, and 
electroporation. The expression vector-containing cells are clonally propagated and individually 
analyzed to determine whether they produce Canine Cathepsin S protein. Identification of 
Canine Cathepsin S expressing host cell clones may be done by several means, including but not 
limited to immunological reactivity with anti-Canine Cathepsin S antibodies, and the presence of 
host cell-associated Canine Cathepsin S activity. 

Expression of Canine Cathepsin S DNA may also be performed using in vitro produced synthetic 
mRNA. Synthetic mRNA or mRNA isolated from Canine Cathepsin S producing cells can be 
efficiently translated in various cell-free systems, including but not limited to wheat germ 
extracts and reticulocyte extracts, as well as efficiently translated in cell based systems, including 
but not limited to microinjection into frog oocytes, with microinjection into frog oocytes being 
generally preferred. 

To determine the Canine Cathepsin S DNA sequence(s) that yields optimal levels of Canine 
Cathepsin S activity and/or Canine Cathepsin S protein, Canine Cathepsin S DNA molecules 
including, but not limited to, the following can be constructed: the full-length open reading 
frame of the Canine Cathepsin S cDNA encoding the 37 kDa protein from approximately base 31 
to approximately base 1028 (these numbers correspond to first nucleotide of first methionine and 
last nucleotide before the first stop codon) and several constructs containing portions of the 
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cDNA encoding Canine Cathepsin S protein. All constructs can be designed to contain none, all 
or portions of the 5' or the 3' untranslated region of Canine Cathepsin S cDNA. Canine 
Cathepsin S activity and levels of protein expression can be determined following the 
introduction, both singly and in combination, of these constructs into appropriate host cells. 
Following determination of the Canine Cathepsin S DNA cassette yielding optimal expression in 
transient assays, this Canine Cathepsin S DNA construct is transferred to a variety of expression 
vectors, for expression in host cells including, but not limited to, mammalian cells, baculovirus- 
infected insect cells, R coli , and the yeast S. cerevisiae. 

Assay Methods for Canine Cathepsin S 

Methods for detecting Canine Cathepsin S activity may involve the direct measurement of 
Canine Cathepsin S activity in whole cells transfected with Canine Cathepsin S cDNA or oocytes 
injected with Canine Cathepsin S mRNA. Canine Cathepsin S activity is measured by specific 
ligand binding or biological characteristics of the host cells expressing Canine Cathepsin S DNA. 

Cell Based Assays 

The present invention provides a whole cell method to detect compound modulation of canine 
Cathepsin S. The method comprises the steps: 

1) contacting a compound, and a cell that contains functional canine 
Cathepsin S, and 

2) measuring a change in the cell in response to modified canine Cathepsin S 
function by the compound. 

The amount of time necessary for cellular contact with the compound is empirically determined, 
for example, by running a time course with a known canine Cathepsin S modulator and 
measuring cellular changes as a function of time. 
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The measurement means of the method of the present invention can be further defined by 
comparing a cell that has been exposed to a compound to an identical cell that has not been 
similarly expose to the compound. Alternatively two cells, one containing functional canine 
Cathepsin S and a second cell identical to the first, but lacking functional canine Cathepsin S 
could be both be contacted with the same compound and compared for differences between the 
two cells. This technique is also useful in establishing the background noise of these assays. One 
of average skill in the art will appreciate that these control mechanisms also allow easy selection 
of cellular changes that are responsive to modulation of functional canine Cathepsin S. 

The term "cell" refers to at least one cell, but includes a plurality of cells appropriate for the 
sensitivity of the detection method. Cells suitable for the present invention may be bacterial, 
yeast, or eukaryotic. 

The assay methods to determine compound modulation of functional canine Cathepsin S can be 
in conventional laboratory format or adapted for high throughput. The term "high throughput" 
refers to an assay design that allows easy analysis of multiple samples simultaneously, and 
capacity for robotic manipulation. Another desired feature of high throughput assays is an assay 
design that is optimized to reduce reagent usage, or minimize the number of manipulations in 
order to achieve the analysis desired. Examples of assay formats include 96- well or 3 84- well 
plates, levitating droplets, and "lab on a chip" microchannel chips used for liquid handling 
experiments. It is well known by those in the art that as miniaturization of plastic molds and 
liquid handling devices are advanced, or as improved assay devices are designed, that greater 
numbers of samples may be performed using the design of the present invention. 

The cellular changes suitable for the method of the present invention comprise directly 
measuring changes in the function or quantity of canine Cathepsin S, or by measuring 
downstream effects of canine Cathepsin S function, for example by measuring secondary 
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messenger concentrations or changes in transcription or by changes in protein levels of genes that 
are transcriptionally influenced by canine Cathepsin S, or by measuring phenotypic changes in 
the cell. Preferred measurement means include changes in the quantity of canine Cathepsin S 
protein, changes in the functional activity of canine Cathepsin S, changes in the quantity of 
mRNA, changes in intracellular protein, changes in cell surface protein, or secreted protein, or 
changes in Ca+2, cAMP or GTP concentration. Changes in the quantity or functional activity of 
canine Cathepsin S are described herein. Changes in the levels of mRNA are detected by reverse 
transcription polymerase chain reaction (RT-PCR) or by differential gene expression. 
Immunoaffinity, ligand affinity, or enzymatic measurement quantifies changes in levels of 
protein in host cells. Protein-specific affinity beads or specific antibodies are used to isolate for 
example 35 S-methionine labelled or unlabelled protein. Labelled protein is analyzed by SDS- 
PAGE. Unlabelled protein is detected by Western blotting, cell surface detection by fluorescent 
cell sorting, cell image analysis, ELISA or RIA employing specific antibodies. Where the protein 
is an enzyme, the induction of protein is monitored by cleavage of a fluorigenic or colorimetric 
substrate. 

A preferred detection means for secreted proteins that are enzymes such as proteases, would be 
fluorescent or colorimetric enzymatic assays. Fluorescent/luminescent/color substrates for 
alkaline phosphatase are commercially available and such assays are easily adaptable to high 
throughput multiwell plate screen format. Fluorescent energy transfer based assays are used for 
protease assays. Fluorophore and quencher molecules are incorporated into the two ends of the 
peptide substrate of the protease. Upon cleavage of the specific substrate, separation of the 
fluorophore and quencher allows the fluorescence to be detectable. When the secreted protein 
could be measure by radioactive methods, scintillation proximity technology could be used. The 
substrate of the protein of interest is immobilized either by coating or incorporation on a solid 
support that contains a fluorescent material. A radioactive molecule, brought in close proximity 
to the solid phase by enzyme reaction, causes the fluorescent material to become excited and emit 
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visible light. Emission of visible light forms the basis of detection of successful ligand/target 
interaction, and is measured by an appropriate monitoring device. An example of a scintillation 
proximity assay is disclosed in United States Patent No. 4,568,649, issued February 4, 1986. 
Materials for these types of assays are commercially available from Dupont NEN® (Boston, 
Massachusetts) under the trade name FlashPlate™. 

A preferred detection means where the endogenous gene results in phenotypic cellular structural 
changes is statistical image analysis the cellular morphology or intracellular phenotypic changes. 
For example, but not by way of limitation, and cell may change morphology such a rounding 
versus remaining flat against a surface, or may become growth-surface independent and thus 
resemble transformed cell phenotype well known in the art of tumor cell biology, or a cell may 
produce new outgrowths. Phenotypic changes that may occur intracellularly include cytoskeletal 
changes, alteration in the endoplasmic reticulum/Golgi complex in response to new gene 
transcription, or production of new vesicles. 

Where the endogenous gene encodes a soluble intracellular protein, changes in the endogenous 
gene may be measured by changes of the specific protein contained within the cell lysate. The 
soluble protein may be measured by the methods described herein. 

The present invention is also directed to methods for screening for compounds that modulate the 
expression of DNA or RNA encoding Canine Cathepsin S as well as the function of Canine 
Cathepsin S protein in vivo. Compounds may modulate by increasing or attenuating the 
expression of DNA or RNA encoding Canine Cathepsin S, or the function of Canine Cathepsin S 
protein. Compounds that modulate the expression of DNA or RNA encoding Canine Cathepsin 
S or the function of Canine Cathepsin S protein may be detected by a variety of assays. The 
assay may be a simple "yes/no" assay to determine whether there is a change in expression or 
function. The assay may be made quantitative by comparing the expression or function of a test 
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sample with the levels of expression or function in a standard sample. Modulators identified in 
this process are useful as therapeutic agents, and Canine Cathepsin S. 

Purification of Canine Cathepsin S Protein 

Following expression of Canine Cathepsin S in a recombinant host cell, Canine Cathepsin S 
protein may be recovered to provide purified Canine Cathepsin S in active form. Several Canine 
Cathepsin S purification procedures are available and suitable for use (Kirschke et aL, Biochem. 
J. (1986) 240:455-459. The cysteine proteinase from bovine lymphoid tissue is distinct from 
L , cathepsin (Kopitar et al., Eur. J. Biochem. (1996)). Folding and activation of human 
§ procathepsin S from inclusion bodies produced in Escherichia coli. (Bromme et al. 7. Biol 

".■SIX? 

M Chem. (1993) 268:4832-4838) Functional Expression of Human Cathepsin S in Saccharomyces 
IS cerevisiae. Bromme and McGrath (1996) Protein Science 5:789-791. High-level expression 
:ff and crystallization of recombinant human cathepsin S). As described above for purification of 
« Canine Cathepsin S from natural sources, recombinant Canine Cathepsin S may be purified from 
14, cell lysates and extracts, or from conditioned culture medium, by various combinations of, or 
S individual application of salt fractionation, ion exchange chromatography, size exclusion 
O chromatography, hydroxylapatite adsorption chromatography and hydrophobic interaction 

chromatography, lectin chromatography, antibody/ligand affinity chromatography or covalent 

chromatography using the reactive cysteine at the active site. 

Recombinant Canine Cathepsin S can be separated from other cellular proteins by use of an 
immunoaffinity column made with monoclonal or polyclonal antibodies specific for full length 
nascent Canine Cathepsin S, polypeptide fragments of Canine Cathepsin S or Canine Cathepsin S 
subunits. The affinity resin is then equilibrated in a suitable buffer, for example phosphate 
buffered saline (pH 7.3), and the cell culture supernatants or cell extracts containing Canine 
Cathepsin S or Canine Cathepsin S subunits are slowly passed through the column. The column 
is then washed with the buffer until the optical density (A 2 go) falls to background, then the 
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protein is eluted by changing the buffer condition, such as by lowering the pH using a buffer such 
as 0.23 M glycine-HCl (pH 2.6). The purified Canine Cathepsin S protein is then dialyzed 
against a suitable buffer such as phosphate buffered saline. 

Protein Based Assay 

The present invention provides an in vitro protein assay method to detect compound modulation 
of canine Cathepsin S protein activity. The method comprises the steps; 

1) contacting a compound, and function canine Cathepsin S protein, and 

2) measuring a change to canine Cathepsin S function by the compound. 

The amount of time necessary for cellular contact with the compound is empirically determined, 
for example, by running a time course with a known canine Cathepsin S modulator and 
measuring changes as a function of time. 

Methods for detecting compounds that modulate canine Cathepsin S proteolytic activity comprise 
combining a punitive modulating compound, functional canine Cathepsin S protein, and a 
suitable labeled substrate and monitoring an effect of the compound on the protease by changes 
in the amount of substrate either as a function of time or after a predefined period of time. 
Labeled substrates include, but are not limited to; substrate that is radiolabeled (Coolican et al. 
(1986). Biol Chem. 261:4170-6), fluorometric (Lonergan et al. (1995). Food Set 60:72-3, 
78; Twining (1984). Anal Biochem. 143:30-4) or colorimetric (Buroker-Kilgore and Wang 
(1993). Anal Biochem. 208:387-92). Radioisotopes useful for use in the present invention 
include those well known in the art, specifically 125 I, 131 I, 3 H, 14 C, 35 S, 32 P, and 33 P. 
Radioisotopes are introduced into the peptide by conventional means, such as iodination of a 
tyrosine residue, phosphorylation of a serine or threonine residue, or incorporation of tritium, 
carbon or sulfur utilizing radioactive amino acid precursors. Zymography following SDS 
polyacrylamide gel electrophoresis (Wadstroem and Smyth (1973). Sci. Tools 20:17-21), as well 
as by fluorescent resonance energy transfer (FRET)-based methods (Ng and Auld (1989). Anal 
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Biochem. 183:50-6) are also methods used to detect compounds that modulate canine Cathepsin 
S proteolytic activity. Compounds that are agonists will increase the rate of substrate 
degradation and will result in less remaining substrate as a function of time. Compounds that are 
antagonists will decrease the rate of substrate degradation and will result in greater remaining 
substrate as a function of time. 

A preferred assay format useful for the method of the present invention is a FRET based method 
using peptide substrates that contain a fluorescent donor with either a quencher or acceptor that 
are separated by a peptide sequence encoding the canine Cathepsin S cleavage site. A 
fluorescent donor is a fluorigenic compound that can adsorb energy and transfers a portion of the 
energy to another compound. Examples of fluorescent donors suitable for use in the present 
invention include, but are not limited to, coumarins, xanthene dyes such as fluoresceines, 
rhodols, and rhodamines, resorufins, cyanine dyes bimanes, acridines, isoindols, dansyl dyes, 
aminophthalic hydrazides such as luminol and isoluminol derivatives, aminophthalimides, 
aminonapthalimides, aminobenzofurans, aminoquinolines, dicanohydroquinones, and europium 
and terbium complexes and related compounds. A quencher reduces the emission from the 
fluorescent donor when it is appropriately proximally located to the donor, and do not generally 
re-emit the energy in the form of fluorescence. Examples of such moieties include indigos, 
bezoquinones, anthraquinones, azo compounds, nitro compounds, indoanilines, and di- and 
triphenylmethanes. A FRET method using a donor/quencher pair measures increased emission 
from the fluorescent donor as a function of canine Cathepsin S enzymatic activity upon the 
peptide substrate. Therefore a test compound that antagonizes canine Cathepsin S will generate 
an emission signal between two control samples - a low (basal) fluorescence from the FRET 
peptide alone and a higher fluorescence from the FRET peptide digested by the activity of 
enzymatically active canine Cathepsin S. An acceptor is a fluorescent molecule that adsorbs 
energy from the fluorescent donor and re-emits a portion of the energy as fluorescence. An 
acceptor is a specific type of quencher that enables a separate mechanism to measure canine 
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Cathepsin S proteolytic efficacy. Methods that utilize a donor/acceptor pair measure a decrease 
in acceptor emission as a function of canine Cathepsin S enzymatic activity upon the peptide 
substrate. Therefore a test compound that antagonizes canine Cathepsin S will generate an 
emission signal between two control samples - a higher basal fluorescence from the FRET 
peptide alone and a lower fluorescence from the FRET peptide digested by the activity of 
enzymatically active canine Cathepsin S. Examples of acceptor useful for methods of the present 
invention include, but are not limited to, coumarins, fluoresceines, rhodols, rhodamines, 
resorufins, cyanines, difuoroboradiazindacenes, and phthalcyanines. 

Production and Use of Antibodies that Bind to Canine Cathepsin S 

Monospecific antibodies to Canine Cathepsin S are purified from mammalian antisera containing 
antibodies reactive against Canine Cathepsin S or are prepared as monoclonal antibodies reactive 
with Canine Cathepsin S using the technique originally described by Kohler and Milstein, Nature 
256: 495-497 (1975). Immunological techniques are well known in the art and described in, for 
example, Antibodies: A laboratory manual published by Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY, ISBN 0879693142. Monospecific antibody as used herein is defined as 
a single antibody species or multiple antibody species with homogenous binding characteristics 
for Canine Cathepsin S. Homogenous binding as used herein refers to the ability of the antibody 
species to bind to a specific antigen or epitope, such as those associated with the Canine 
Cathepsin S, as described above. Canine Cathepsin S specific antibodies are raised by 
immunizing animals such as mice, rats, guinea pigs, rabbits, goats, horses and the like, with 
rabbits being preferred, with an appropriate concentration of Canine Cathepsin S either with or 
without an immune adjuvant. 

Pre-immune serum is collected prior to the first immunization. Each animal receives between 
about 0.001 mg and about 1000 mg of Canine Cathepsin S associated with an acceptable immune 
adjuvant. Such acceptable adjuvants include, but are not limited to, Freund's complete, Freund's 
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incomplete, alum-precipitate, water in oil emulsion containing Corvnebacterium parvum and 
tRNA. The initial immunization consists of Canine Cathepsin S in, preferably, Freund's 
complete adjuvant at multiple sites either subcutaneously (SC), intraperitoneally (IP) or both. 
Each animal is bled at regular intervals, preferably weekly, to determine antibody titer. The 
animals may or may not receive booster injections following the initial immunization. Those 
animals receiving booster injections are generally given an equal amount of the antigen in 
Freund's incomplete adjuvant by the same route. Booster injections are given at about three- 
week intervals until maximal titers are obtained. At about 7 days after each booster 
immunization or about weekly after a single immunization, the animals are bled, the serum 
collected, and aliquots are stored at about -20°C. 

Monoclonal antibodies (mAb) reactive with Canine Cathepsin S are prepared by immunizing 
inbred mice, preferably Balb/c, with Canine Cathepsin S. The mice are immunized by the IP or 
SC route with about 0.001 mg to about 1.0 mg, preferably about 0.1 mg, of Canine Cathepsin S 
in about 0.1 ml buffer or saline incorporated in an equal volume of an acceptable adjuvant, as 
discussed above. Freund's adjuvant is preferred, with Freund's complete adjuvant being used for 
the initial immunization and Freund's incomplete adjuvant used thereafter. The mice receive an 
initial immunization on day 0 and are rested for about 2 to about 30 weeks. Immunized mice are 
given one or more booster immunizations of about 0.001 to about 1.0 mg of Canine Cathepsin S 
in a buffer solution such as phosphate buffered saline by the intravenous (IV) route. 
Lymphocytes, from antibody positive mice, preferably splenic lymphocytes, are obtained by 
removing spleens from immunized mice by standard procedures known in the art. Hybridoma 
cells are produced by mixing the splenic lymphocytes with an appropriate fusion partner, 
preferably myeloma cells, under conditions that will allow the formation of stable hybridomas. 
Fusion partners may include, but are not limited to: mouse myelomas P3/NSl/Ag 4-1; MPC-11; 
S-194 and Sp2/0, with Sp2/0 being generally preferred. The antibody producing cells and 
myeloma cells are fused in polyethylene glycol, about 1000 mol. wt., at concentrations from 
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about 30% to about 50%. Fused hybridoma cells are selected by growth in hypoxanthine, 
thymidine and aminopterin supplemented Dulbecco's Modified Eagles Medium (DMEM) by 
procedures known in the art. Supernatant fluids are collected from growth positive wells on 
about days 14, 18, and 21 and are screened for antibody production by an immunoassay such as 
solid phase immunoradioassay (SPIRA) using Canine Cathepsin S as the antigen. The culture 
fluids are also tested in the Ouchterlony precipitation assay to determine the isotype of the mAb. 
Hybridoma cells from antibody positive wells are cloned by a technique such as the soft agar 
technique of MacPherson, Soft Agar Techniques, in Tissue Culture Methods and Applications, 
Kruse and Paterson, Eds., Academic Press, 1973 or by the technique of limited dilution. 

Monoclonal antibodies are produced in vivo by injection of pristane primed Balb/c mice, 
approximately 0.5 ml per mouse, with about 1 x 10 6 to about 6 x 10 6 hybridoma cells at least 
about 4 days after priming. Ascites fluid is collected at approximately 8-12 days after cell 
transfer and the monoclonal antibodies are purified by techniques known in the art. 

In vitro production of anti-Canine Cathepsin S mAb is carried out by growing the hybridoma in 
tissue culture media using techniques and materials that are well known in the art. High density 
in vitro cell culture may be conducted to produce large quantities of anti-canine Cathepsin S 
mAbs using hollow fiber culture techniques, air lift reactors, roller bottle, or spinner flasks 
culture techniques well known in the art. The mAb are purified by techniques known in the art. 

Antibody titers of ascites or hybridoma culture fluids are determined by various serological or 
immunological assays which include, but are not limited to, precipitation, passive agglutination, 
enzyme-linked immunosorbent antibody (ELISA) technique and radioimmunoassay (RIA) 
techniques. Similar assays are used to detect the presence of Canine Cathepsin S in body fluids 
or tissue and cell extracts. 
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It is readily apparent to those skilled in the art that the above-described methods for producing 
monospecific antibodies may be utilized to produce antibodies specific for Canine Cathepsin S 
polypeptide fragments, or full-length nascent Canine Cathepsin S polypeptide, or the individual 
Canine Cathepsin S subunits. Specifically, it is readily apparent to those skilled in the art that 
monospecific antibodies may be generated that are specific for only one Canine Cathepsin S 
subunit or the fully functional canine Cathepsin S protein. It is also apparent to those skilled in 
the art that monospecific antibodies may be generated that inhibit normal function of canine 
Cathepsin S protein. 

Canine Cathepsin S antibody affinity columns are made by adding the antibodies to a gel support 
such that the antibodies form covalent linkages with the gel bead support. Preferred covalent 
linkages are made through amine, aldehyde, or sulfhydryl residues contained on the antibody. 
Methods to generate aldehydes or free sulfhydryl groups on antibodies are well known in the art; 
amine groups are reactive with, for example, N-hydroxysuccinimide esters. 

Since there is a significant difference in both the nucleic acid and amino acid sequences between 
the cathepsin S of different species, it is possible that inhibitors can be found that only inhibit a 
subset of the species. For treating human disease the inhibitors must work in humans. However, 
during the discovery and development of such compounds it is necessary to test these inhibitors 
in different animal models. In general it is best to know if a given inhibitor is active against the 
enzyme of that species. There are several canine models that could be used to test human 
cathepsin S inhibitors and therefore canine cathepsin S can be used determine if a given set of 
inhibitors can be used in these models. Once a compound is shown to be an inhibitor of human 
cathepsin S it can then be tested against canine cathepsin S before it is used in a canine model of 
human disease. 
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Kit Compositions Containing Canine Cathepsin S Specific Reagents 

Kits containing Canine Cathepsin S DNA or RNA, antibodies to Canine Cathepsin S, or Canine 
Cathepsin S protein may be prepared. Such kits are used to detect DNA that hybridizes to 
Canine Cathepsin S DNA or to detect the presence of Canine Cathepsin S protein or peptide 
fragments in a sample. Such characterization is useful for a variety of purposes including but not 
limited to forensic analyses, diagnostic applications, and epidemiological studies. 

The DNA molecules, RNA molecules, recombinant protein and antibodies of the present 
invention may be used to screen and measure levels of Canine Cathepsin S DNA, Canine 
Cathepsin S RNA or Canine Cathepsin S protein. The recombinant proteins, DNA molecules, 
RNA molecules and antibodies lend themselves to the formulation of kits suitable for the 
detection and typing of Canine Cathepsin S. Such a kit would comprise a compartmentalized 
carrier suitable to hold in close confinement at least one container. The carrier would further 
comprise reagents such as recombinant Canine Cathepsin S protein or anti-Canine Cathepsin S 
antibodies suitable for detecting Canine Cathepsin S. The carrier may also contain a means for 
detection such as labeled antigen or enzyme substrates or the like. 

Gene Therapy 

Nucleotide sequences that are complementary to the Canine Cathepsin S encoding DNA 
sequence can be synthesized for antisense therapy. These antisense molecules may be DNA, 
stable derivatives of DNA such as phosphorothioates or methylphosphonates, RNA, stable 
derivatives of RNA such as 2-O-alkylRNA, or other Canine Cathepsin S antisense 
oligonucleotide mimetics. Canine Cathepsin S antisense molecules may be introduced into cells 
by microinjection, liposome encapsulation or by expression from vectors harboring the antisense 
sequence. Canine Cathepsin S antisense therapy may be particularly useful for the treatment of 
diseases where it is beneficial to reduce Canine Cathepsin S activity. 
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Canine Cathepsin S gene therapy may be used to introduce Canine Cathepsin S into the cells of 
target organisms. The Canine Cathepsin S gene can be ligated into viral vectors that mediate 
transfer of the Canine Cathepsin S DNA by infection of recipient host cells. Suitable viral 
vectors include retrovirus, adenovirus, adeno-associated virus, herpes virus, vaccinia virus, 
poliovirus and the like. Alternatively, Canine Cathepsin S DNA can be transferred into cells for 
gene therapy by non- viral techniques including receptor-mediated targeted DNA transfer using 
ligand-DNA conjugates or adenovirus-ligand-DNA conjugates, lipofection membrane fusion or 
direct microinjection. These procedures and variations thereof are suitable for ex vivo as well as 
8 1 in vivo Canine Cathepsin S gene therapy. Canine Cathepsin S gene therapy may be particularly 
O useful for the treatment of diseases where it is beneficial to elevate Canine Cathepsin S activity. 
|^ Protocols for molecular methodology of gene therapy suitable for use with the canine Cathepsin 
j^j S gene is described in Gene Therapy Protocols , edited by Paul D. Robbins, Human press, Totawa 
W NJ, 1996. 

jj, Pharmaceutical Compositions 

U Pharmaceutical^ useful compositions comprising Canine Cathepsin S DNA, Canine Cathepsin S 
O RNA, or Canine Cathepsin S protein, or modulators of Canine Cathepsin S receptor activity, may 
be formulated according to known methods such as by the admixture of a pharmaceutically 
acceptable carrier. Examples of such carriers and methods of formulation may be found in 
Remington's Pharmaceutical Sciences. To form a pharmaceutically acceptable composition 
suitable for effective administration, such compositions will contain an effective amount of the 
protein, DNA, RNA, or modulator. 

Therapeutic or diagnostic compositions of the invention are administered to a subject in amounts 
sufficient to treat or diagnose disorders in which modulation of Cathepsin S-related activity is 
indicated. The effective amount may vary according to a variety of factors such as the 
individual's condition, weight, sex and age. Other factors include the mode of administration. 
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The pharmaceutical compositions may be provided to the individual by a variety of routes such 
as subcutaneous, topical, oral and intramuscular. 

The term "chemical derivative" describes a molecule that contains additional chemical moieties 
that are not normally a part of the base molecule. Such moieties may improve the solubility, 
half-life, absorption, etc. of the base molecule. Alternatively the moieties may attenuate 
undesirable side effects of the base molecule or decrease the toxicity of the base molecule. 
Examples of such moieties are described in a variety of texts, such as Remington's 
Pharmaceutical Sciences. 

Compounds identified according to the methods disclosed herein may be used alone at 
appropriate dosages defined by routine testing in order to obtain optimal inhibition of the Canine 
Cathepsin S receptor or its activity while minimizing any potential toxicity. In addition, co- 
administration or sequential administration of other agents may be desirable. 

The present invention also has the objective of providing suitable topical, oral, systemic and 
parenteral pharmaceutical formulations for use in the novel methods of treatment of the present 
invention. The compositions containing compounds or modulators identified according to this 
invention as the active ingredient for use in the modulation of Canine Cathepsin S can be 
administered in a wide variety of therapeutic dosage forms in conventional vehicles for 
administration. For example, the compounds or modulators can be administered in such oral 
dosage forms as tablets, capsules (each including timed release and sustained release 
formulations), pills, powders, granules, elixirs, tinctures, solutions, suspensions, syrups and 
emulsions, or by injection. Likewise, they may also be administered in intravenous (both bolus 
and infusion), intraperitoneal, subcutaneous, topical with or without occlusion, or intramuscular 
form, all using forms well known to those of ordinary skill in the pharmaceutical arts. An 
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effective but non-toxic amount of the compound desired can be employed as a Canine Cathepsin 
S modulating agent. 

The daily dosage of the products may be varied over a wide range from 0.01 to 1,000 mg per 
patient, per day. For oral administration, the compositions are preferably provided in the form of 
scored or un-scored tablets containing 0.01, 0.05, 0.1, 0.5, 1.0, 2.5, 5.0, 10.0, 15.0, 25.0, and 50.0 
milligrams of the active ingredient for the symptomatic adjustment of the dosage to the patient to 
be treated. An effective amount of the drug is ordinarily supplied at a dosage level of from about 
0.0001 mg/kg to about 100 mg/kg of body weight per day. The range is more particularly from 
about 0.001 mg/kg to 10 mg/kg of body weight per day. The dosages of the Canine Cathepsin S 
receptor modulators are adjusted when combined to achieve desired effects. On the other hand, 
dosages of these various agents may be independently optimized and combined to achieve a 
synergistic result wherein the pathology is reduced more than it would be if either agent were 
used alone. 

Advantageously, compounds or modulators of the present invention may be administered in a 
single daily dose, or the total daily dosage may be administered in divided doses of two, three or 
four times daily. Furthermore, compounds or modulators for the present invention can be 
administered in intranasal form via topical use of suitable intranasal vehicles, or via transdermal 
routes, using those forms of transdermal skin patches well known to those of ordinary skill in 
that art. To be administered in the form of a transdermal delivery system, the dosage 
administration will, of course, be continuous rather than intermittent throughout the dosage 
regimen. 

For combination treatment with more than one active agent, where the active agents are in 
separate dosage formulations, the active agents can be administered concurrently, or they each 
can be administered at separately staggered times. 
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The dosage regimen utilizing the compounds or modulators of the present invention is selected in 
accordance with a variety of factors including type, species, age, weight, sex and medical 
condition of the patient; the severity of the condition to be treated; the route of administration; 
the renal and hepatic function of the patient; and the particular compound thereof employed. A 
physician or veterinarian of ordinary skill can readily determine and prescribe the effective 
amount of the drug required to prevent, counter or arrest the progress of the condition. Optimal 
precision in achieving concentrations of drug within the range that yields efficacy without 
toxicity requires a regimen based on the kinetics of the drug's availability to target sites. This 
involves a consideration of the distribution, equilibrium, and elimination of a drag. 

p In the methods of the present invention, the compounds or modulators herein described in detail 
jjj can form the active ingredient, and are typically administered in admixture with suitable 

In pharmaceutical diluents, excipients or carriers (collectively referred to herein as "carrier" 

O 

. ; materials) suitably selected with respect to the intended form of administration, that is, oral 

P tablets, capsules, elixirs, syrups and the like, and consistent with conventional pharmaceutical 

D practices. 

W 

For instance, for oral administration in the form of a tablet or capsule, the active drug component 
can be combined with an oral, non-toxic pharmaceutically acceptable inert carrier such as 
ethanol, glycerol, water and the like. Moreover, when desired or necessary, suitable binders, 
lubricants, disintegrating agents and coloring agents can also be incorporated into the mixture. 
Suitable binders include, without limitation, starch, gelatin, natural sugars such as glucose or 
beta-lactose, corn sweeteners, natural and synthetic gums such as acacia, tragacanth or sodium 
alginate, carboxymethylcellulose, polyethylene glycol, waxes and the like. Lubricants used in 
these dosage forms include, without limitation, sodium oleate, sodium stearate, magnesium 
stearate, sodium benzoate, sodium acetate, sodium chloride and the like. Disintegrators include, 
without limitation, starch, methylcellulose, agar, bentonite, xanthan gum and the like. 
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For liquid forms the active drug component can be combined in suitably flavored suspending or 
dispersing agents such as the synthetic and natural gums, for example, tragacanth, acacia, methyl- 
cellulose and the like. Other dispersing agents that may be employed include glycerin and the 
like. For parenteral administration, sterile suspensions and solutions are desired. Isotonic 
preparations, which generally contain suitable preservatives, are employed when intravenous 
administration is desired. 



Topical preparations containing the active drug component can be admixed with a variety of 
carrier materials well known in the art, such as, e.g., alcohols, aloe vera gel, allantoin, glycerine, 
vitamin A and E oils, mineral oil, PPG2 myristyl propionate, and the like, to form, e.g., alcoholic 
solutions, topical cleansers, cleansing creams, skin gels, skin lotions, and shampoos in cream or 
gel formulations. 



The compounds or modulators of the present invention can also be administered in the form of 
liposome delivery systems, such as small unilamellar vesicles, large unilamellar vesicles and 
multilamellar vesicles. Liposomes can be formed from a variety of phospholipids, such as 
cholesterol, stearylamine or phosphatidylcholines. 

Compounds of the present invention may also be delivered by the use of monoclonal antibodies 
as individual carriers to which the compound molecules are coupled. The compounds or 
modulators of the present invention may also be coupled with soluble polymers as targetable 
drug carriers. Such polymers can include polyvinyl-pyrrolidone, pyran copolymer, 
polyhydroxypropylmethacryl-amidephenol, polyhydroxy-ethylaspartamidephenol, or polyethyl- 
eneoxidepolylysine substituted with palmitoyl residues. Furthermore, the compounds or 
modulators of the present invention may be coupled to a class of biodegradable polymers useful 
in achieving controlled release of a drug, for example, polylactic acid, polyepsilon caprolactone, 
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polyhydroxy butyric acid, polyorthoesters, polyacetals, polydihydro-pyrans, polycyanoacrylates 
and cross-linked or amphipathic block copolymers of hydrogels. 

For oral administration, the compounds or modulators may be administered in capsule, tablet, or 
bolus form or alternatively they can be mixed in the animals feed. The capsules, tablets, and 
boluses are comprised of the active ingredient in combination with an appropriate carrier vehicle 
such as starch, talc, magnesium stearate, or di-calcium phosphate. These unit dosage forms are 
prepared by intimately mixing the active ingredient with suitable finely powdered inert 
ingredients including diluents, fillers, disintegrating agents, and/or binders such that a uniform 
mixture is obtained. An inert ingredient is one that will not react with the compounds or 
modulators and which is non-toxic to the animal being treated. Suitable inert ingredients include 
starch, lactose, talc, magnesium stearate, vegetable gums and oils, and the like. These 
formulations may contain a widely variable amount of the active and inactive ingredients 
depending on numerous factors such as the size and type of the animal species to be treated and 
the type and severity of the infection. The active ingredient may also be administered as an 
additive to the feed by simply mixing the compound with the feedstuff or by applying the 
compound to the surface of the feed. Alternatively the active ingredient may be mixed with an 
inert carrier and the resulting composition may then either be mixed with the feed or fed directly 
to the animal. Suitable inert carriers include corn meal, citrus meal, fermentation residues, soya 
grits, dried grains and the like. The active ingredients are intimately mixed with these inert 
carriers by grinding, stirring, milling, or tumbling such that the final composition contains from 
0.001 to 5% by weight of the active ingredient. 

The compounds or modulators may alternatively be administered parenterally via injection of a 
formulation consisting of the active ingredient dissolved in an inert liquid carrier. Injection may 
be either intramuscular, intraruminal, intratracheal, or subcutaneous. The injectable formulation 
consists of the active ingredient mixed with an appropriate inert liquid carrier. Acceptable liquid 
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carriers include the vegetable oils such as peanut oil, cottonseed oil, sesame oil and the like as 
well as organic solvents such as solketal, glycerol formal and the like. As an alternative, aqueous 
parenteral formulations may also be used. The vegetable oils are the preferred liquid carriers. 
The formulations are prepared by dissolving or suspending the active ingredient in the liquid 
carrier such that the final formulation contains from approximately 0.005 to about 10% by weight 
of the active ingredient. 

Topical application of the compounds or modulators is possible through the use of a liquid 
drench or a shampoo containing the instant compounds or modulators as an aqueous solution or 
suspension. These formulations generally contain a suspending agent such as bentonite and 
normally will also contain an antifoaming agent. Formulations containing from 0.005 to 10% by 
weight of the active ingredient are acceptable. Preferred formulations are those containing from 
0.01 to 5% by weight of the instant compounds or modulators. 

The following examples illustrate the present invention without, however, limiting the same 
thereto. 

EXAMPLE 1 
CLONING OF CANINE CATHEPSIN S cDNA 

A canine blood cell library was constructed from poly (A)+ selected RNA using standard 
procedures known in the art. Briefly, mRNA from lOxlO 6 cells was purified on oligo(dT)- 
cellulose according to the manufacturers instructions (Invitrogen, San Diego, CA). 0.5 |Lig 
mRNA was used to synthesize and amplify double-stranded cDNA (SMART cDNA synthesis 
kit, Clonetech, Palo Alto, CA). The resulting product was size-selected using a 0.8% low- 
melting agarose gel. cDNA in the range 0.6-5 kb was subsequently ligated into PCR 2.1 using 
TOPO TA cloning (Invitrogen) and transformed into E. coli. 
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A 301 bp EcoRI-PflMI fragment from human cathepsin S was used to screen the plated library 
after labelling with 32 P-dCTP using random priming (Stratagene, San Diego, CA). Library filters 
were prehybridized and hybridized according to standard protocols (buffer from Sigma). After 
hybridization the filters were washed twice at room temperature in 2X standard saline citrate 
(SSC)/0.2%SDS for fifteen minutes followed by two washes at 50°C in 0.2XSSC/0.1%SDS for 
fifteen minutes. Filters were exposed to film and developed. DNA prepared from positive 
clones was subsequently sequenced using standard protocols and equipment (ABI 377, Perkin- 
Elmer, Norwalk, CT). Based on the full-length sequence, 5' (5 TTATTG AATTCGCC ACC AT 
u GAAATGGCTAGTTGGGCTGCTT) SEQ ID NO: 3, and 3' (5TTTGTCGACCT 
S AGATTTCTGG GTAAG) SEQ ID NO: 4, primers were constructed, incorporating into Eco RI 
jl and Sal I sites respectively. These primers were used to amplify a full length PCR product from 
jjj the SMART first strand product. After digestion with Eco RI and Sal I the fragment was ligated 
ttj into Eco RI and Sal I digested pFastbac (GibcoBRL, Rockville, MD). 

IsseP 
3: 

jjasjs 

2 EXAMPLE 2 

O PRIMARY STRUCTURE OF THE CANINE CATHEPSIN S PROTEIN 

The nucleotide sequences of pCanine Cathepsin S revealed single large open reading frame of 
about Canine Cathepsin S base pairs. The cDNAs have 5' and 3'-untranslated extensions of 
about 35 and about 96 nucleotides for Canine Cathepsin S. The first in-frame methionine was 
designated as the initiation codon for an open reading frame that predicts a Canine Cathepsin S 
protein with an estimated molecular mass (M r ) of about 37 kDa. The protein contained 
hydrophobic amino-terminal residues with sequences highly predictive of signal cleavage sites 
and a propeptide that is removed to result in predicted mature protein initiating at amino acid 
115. 
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The predicted Canine Cathepsin S protein was aligned with the Cathepsin S sequences from 
other species that are known (human, mouse, rat and bovine mature protein only). There is an 
87, 74 and 75% amino acid identity versus human, mouse and rat cathepsin S, respectively, and 
91, 82 and 80% conserved. When only the mature protein is considered the amino acid identities 
become 90, 84, 84 and 90% and 94, 89, 87 and 94% conserved versus human, mouse, rat and 
bovine. The canine Cathepsin S sequence contains active site residues Cys at position 139, His 
at position 278 and Asn at position 298 which are conserved amongst all of the species whose 
sequence is known. 

fall 

0 EXAMPLE 3 

C CLONING OF THE CANINE CATHEPSIN S cDNA INTO 

S E. COLI EXPRESSION VECTORS 

Vff 

„~ Recombinant Canine Cathepsin S is produced in R coH following the transfer of the Canine 
u, Cathe P sin s expression cassette into E. coh expression vectors, including but not limited to, the 

g pET series (Novagen). The pET vectors place Canine Cathepsin S expression under control of 

IB 

q the tightly regulated bacteriophage T7 promoter. Following transfer of this construct into an E. 

s ~" coh host that contains a chromosomal copy of the T7 RNA polymerase gene driven by the 
inducible lac promoter, expression of Canine Cathepsin S is induced when an appropriate lac 
substrate (IPTG) is added to the culture. The levels of expressed Canine Cathepsin S are 
determined by the assays described herein. 

The cDNA encoding the entire open reading frame for Canine Cathepsin S is inserted into the 
Ndel site of pET [16 ]lla. Constructs in the positive orientation are identified by sequence 
analysis and used to transform the expression host strain BL21. Transformants are then used to 
inoculate cultures for the production of Canine Cathepsin S protein. Cultures may be grown in 
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M9 or ZB media, whose formulation is known to those skilled in the art. After growth to an 
OD 600= 15 ' expression of Canine Cathepsin S is induced with 1 mM IPTG for 3 hours at 37°C. 

EXAMPLE 4 

CLONING OF CANINE CATHEPSIN S cDNA INTO A 
MAMMALIAN EXPRESSION VECTOR 

The Canine Cathepsin S cDNAs were cloned into the mammalian expression vectors pMAMneo 
and pcDNA3. The Canine Cathepsin S Bluescript plasmids were digested with Not I and treated 
O with Klenow enzyme to create a blunt cloning end. The inserts were excised with Sal I digestion 
p and purified by agarose gel electrophoresis. The pMAMneo vector was treated with Xhol , 
7Z Klenow enzyme and then Sail and calf intestinal phosphatase (CIP). The linear vector was 
m purified on agarose gel and used to ligate to the Canine Cathepsin S cDNA inserts. 
,7 Recombinants were isolated, designated Canine Cathepsin S, and used to transfect mammalian 
Q cells (L-cells) by CaPO^DNA precipitation. Stable cell clones were selected by growth in the 
12 P resenc e of G418. Single G418 resistant clones were isolated and shown to contain the intact 
Q Canine Cathepsin S gene. Clones containing the Canine Cathepsin S cDNAs are analyzed for 
■ expression using immunological techniques, such as immunoprecipitation, Western blot, and 
immunofluorescence using antibodies specific to the Canine Cathepsin S proteins. Antibody is 
obtained from rabbits inoculated with peptides that are synthesized from the amino acid sequence 
predicted from the Canine Cathepsin S sequences. 

The Canine Cathepsin S genes were inserted into pcDNA3. Canine Cathepsin S was digested 
with Xhol and NotI and the cDNA inserts isolated by agarose gel electrophoresis. The vector, 
pcDNA3, was digested with Xhol and NotI, treated with CIP and the linear vector isolated by gel 
electrophoresis, and ligated with cDNA inserts. Recombinant plasmids Canine Cathepsin S were 
used to transform the mammalian COS or CHO cells. 
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Cells that are expressing Canine Cathepsin S, stably or transiently, are used to test for expression 
of the protease. These cells are used to identify and examine other compounds for their ability to 
modulate, inhibit or activate the protease and to compete for labelled substrate. 

Cassettes containing the Canine Cathepsin S cDNA in the positive orientation with respect to the 
promoter are ligated into appropriate restriction sites 3' of the promoter and identified by 
restriction site mapping and/or sequencing. These cDNA expression vectors are introduced into 
fibroblastic host cells for example COS-7 (ATCC# CRL1651), and CV-1 tat [Sackevitz et al., 
u Science 238: 1575 (1987)], 293, L (ATCC# CRL6362)] by standard methods including but not 
Q limited to electroporation, or chemical procedures (cationic liposomes, DEAE dextran, calcium 
C phosphate). Transfected cells and cell culture supernatants can be harvested and analyzed for 
Canine Cathepsin S expression as described herein. 

(Xj 
ti 

All of the vectors used for mammalian transient expression can be used to establish stable cell 
H lines expressing Canine Cathepsin S. Unaltered Canine Cathepsin S cDNA constructs cloned 
Q into expression vectors are expected to program host cells to make Canine Cathepsin S protein. 
O In addition, Canine Cathepsin S is expressed extracellularly as a secreted protein by ligating 

Canine Cathepsin S cDNA constructs to DNA encoding the signal sequence of a secreted protein. 

The transfection host cells include, but are not limited to, CV-l-P [Sackevitz et aL, Science 238: 

1575 (1987)], tk-L [Wigler, et aL Cell 11: 223 (1977)], NS/0, and dHFr- CHO [Kaufman and 

Sharp, J. Mol. Biol. 159: 601, (1982)]. 

Co-transfection of any vector containing Canine Cathepsin S cDNA with a drug selection 
plasmid including, but not limited to G418, aminoglycoside phosphotransferase; hygromycin, 
hygromycin-B phosphotransferase; APRT, xanthine-guanine phosphoribosyl-transferase, will 
allow for the selection of stably transfected clones. Levels of Canine Cathepsin S are quantitated 
by the assays described herein. 
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Canine Cathepsin S cDNA constructs are also ligated into vectors containing amplifiable drug- 
resistance markers for the production of mammalian cell clones synthesizing the highest possible 
levels of Canine Cathepsin S. Following introduction of these constructs into cells, clones 
containing the plasmid are selected with the appropriate agent, and isolation of an over- 
expressing clone with a high copy number of plasmids is accomplished by selection in increasing 
doses of the agent. 

The expression of recombinant Canine Cathepsin S is achieved by transfection of full-length 
Canine Cathepsin S cDNA into a mammalian host cell. 

[1 EXAMPLE 5 

m CLONING OF CANINE CATHEPSIN S cDNA INTO A BACULOVIRUS EXPRESSION 

Q 

i, VECTOR FOR EXPRESSION IN INSECT CELLS 

!=& 

Q Baculovirus vectors, which are derived from the genome of the AcNPV virus, are designed to 
p provide high-level expression of cDNA in the Sf9 line of insect cells (ATCC CRL# 1711). Sf9 
5 insect cells and SFM IT medium were purchased from GibcoBRL. Bacmids, Sf9 cell 
transfections and subsequent infections were made according to the manufacturers protocol 
(GibcoBRL). Primary and secondary virus stocks were allowed to totally lyse. Sf9 cells at 2.5- 
3xl0 6 were infected at an MOI of 1 and allowed to totally lyse. Supernatants were removed and 
further processed. Canine Cathepsin S was found secreted into the culture medium of these cells. 
The production of active canine cathepsin s from the cells is shown in Figure 3. 
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EXAMPLE 6 

CLONING OF CANINE CATHEPSIN S cDNA INTO A 
YEAST EXPRESSION VECTOR 

Recombinant Canine Cathepsin S is produced in the yeast S. cerevisiae following the insertion of 
the optimal Canine Cathepsin S cDNA cistron into expression vectors designed to direct the 
intracellular or extracellular expression of heterologous proteins. In the case of intracellular 
5 expression, vectors such as EmBLyex4 or the like are ligated to the Canine Cathepsin S cistron 
Q [Rinas, U. et al, Biotechnology 8: 543-545 (1990); Horowitz B. et al., J. Biol. Chem. 265: 4189- 
jl 4192 (1989)]. For extracellular expression, the Canine Cathepsin S cistron is ligated into yeast 

.JSSS. 

" expression vectors that fuse a secretion signal (a yeast or mammalian peptide) to the NH 2 
19 terminus of the Canine Cathepsin S protein [Jacobson, M. A., Gene 85: 511-516 (1989); Riett L. 
l~ and Bellon N. Biochem. 28: 2941-2949 (1989)]. 

5 These vectors include, but are not limited to pAVEl>6, which fuses the human serum albumin 
p signal to the expressed cDNA [Steep O. Biotechnology 8: 42-46 (1990)], and the vector pL8PL 
r * which fuses the human lysozyme signal to the expressed cDNA [Yamamoto, Y., Biochem. 28: 
2728-2732)]. In addition, Canine Cathepsin S is expressed in yeast as a fusion protein 
conjugated to ubiquitin utilizing the vector pVEP [Ecker, D. J., J. Biol. Chem. 264: 7715-7719 
(1989), Sabin, E. A., Biotechnology 7: 705-709 (1989), McDonnell D. P., Mol. Cell Biol. 9: 
5517-5523 (1989)]. The levels of expressed Canine Cathepsin S are determined by the assays 
described herein. 
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EXAMPLE 7 

PURIFICATION OF RECOMBINANT CANINE CATHEPSIN S 

Recombinantly produced Canine Cathepsin S may be purified by antibody affinity 
chromatography. Canine Cathepsin S antibody affinity columns are made by adding the anti- 
Canine Cathepsin S antibodies to Affigel-10 (Biorad), a gel support that is pre-activated with N- 
hydroxysuccinimide esters such that the antibodies form covalent linkages with the agarose gel 
bead support. The antibodies are then coupled to the gel via amide bonds with the spacer arm. 
_ The remaining activated esters are then quenched with 1M ethanolamine HC1 (pH 8). The 

O column is washed with water followed by 0.23 M glycine HC1 (pH 2.6) to remove any non- 
113 

jjh conjugated antibody or extraneous protein. The column is then equilibrated in phosphate 

fi buffered saline (pH 7.3) together with appropriate membrane solubilizing agents such as 

IS detergents and the cell culture supernatants or cell extracts containing solubilized Canine 

Q 

„ Cathepsin S are slowly passed through the column. The column is then washed with phosphate- 

r buffered saline together with detergents until the optical density (A280) falls to background, then 

Q the protein is eluted with 0.23 M glycine-HCl (pH 2.6) together with detergents. The purified 

p Canine Cathepsin S protein is then dialyzed against 10 mM sodium acetate pH 4.5 buffer. 

EXAMPLE 8 

PURIFICATION OF RECOMBINANT CANINE CATHEPSIN S 
USING THIOPROPYL SEPHAROSE 

Recombinantly produced Canine Cathepsin S may be purified by thiopropyl Sepharose 
chromatography. 
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Canine Cathepsin S was purified based on the published procedure for the purification of human 
cathepsin S [Vernet, 1993; McGrath, 1996]. In general 2 L of supernatant from infected Sf9 cells 
was adjusted to pH 4.5 with 100 mM sodium acetate and incubated overnight at 4 °C. 
Afterwards 7g of thiopropyl Sepharose 6B (Pharmacia, Uppsala, Sweden) was added and the 
solution was stirred overnight at 4 °C. Washes and elutions were done in a batch mode and the 
filtrates were collected by filtration with a sintered glass filter. The Sepharose was washed two 
times with 250 mL of 100 mM sodium acetate pH 4.5, 1 mM EDTA and then two times with 250 
mL of 100 mM Tris-HCl pH 8.0, 1 mM EDTA. The cathepsin S is then eluted by adding 100 
mL of 100 mM MES pH 6.5 containing 1 mM EDTA, 10 mM DTT and 50 mM cysteine and 
stirring at room temperature for 1 hour. Elution is repeated three times to remove all of the 
bound cathepsin S. The activity is monitored using 20 |LiM Z-VVR-AMC as the substrate. The 
active fractions were pooled and dialyzed overnight against 10 mM sodium acetate pH 4.5 at 4 
°C. The dialyzed sample was adjusted to 25% glycerol and frozen at -80 °C. 



EXAMPLE 9 

ASSAY FOR INHIBITORS OF CANINE CATHEPSIN S 



Inhibitors of Canine Cathepsin S can be assayed using a fluorescent-based protease assay. All 
assays were carried out in a buffer consisting of 100 mM sodium acetate pH 5.0 containing 100 
mM NaCl and 1 .5 mM DTT. The substrate Z-VVR-AMC was used at a final concentration of 20 
jiiM. The volume was 100 ^iL in a microtiter plate and the increase in fluorescence was read on a 
CytoFluor II (Perseptive Biosystems, Framingham, MA) use an excitation filter of 360/ nm and 
an emission filter of 460/ nm. The initial rate of product formation as judged by the increase in 
fluorescence intensity as a function of time was measured using linear regression. Inhibitors of 
canine cathepsin S lead to a decrease in this initial rate. The results are shown in Figure 4. 
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